SfM-Net: Learning of Structure and Motion from Video

نویسندگان

  • Sudheendra Vijayanarasimhan
  • Susanna Ricco
  • Cordelia Schmid
  • Rahul Sukthankar
  • Katerina Fragkiadaki
چکیده

We propose SfM-Net, a geometry-aware neural network for motion estimation in videos that decomposes frameto-frame pixel motion in terms of scene and object depth, camera motion and 3D object rotations and translations. Given a sequence of frames, SfM-Net predicts depth, segmentation, camera and rigid object motions, converts those into a dense frame-to-frame motion field (optical flow), differentiably warps frames in time to match pixels and back-propagates. The model can be trained with various degrees of supervision: 1) self-supervised by the reprojection photometric error (completely unsupervised), 2) supervised by ego-motion (camera motion), or 3) supervised by depth (e.g., as provided by RGBD sensors). SfMNet extracts meaningful depth estimates and successfully estimates frame-to-frame camera rotations and translations. It often successfully segments the moving objects in the scene, even though such supervision is never provided.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Global Structure-from-Motion and Its Application

Structure-from-motion (SfM) is a fundamental problem in 3D computer vision, with the aim of recovering camera poses and 3D scene structure simultaneously given a set of 2D images. SfM methods can be broadly divided into incremental and global methods according to their ways to register cameras. Incremental methods register cameras one by one, while global SfM methods solve all cameras simultane...

متن کامل

Video Subject Inpainting: A Posture-Based Method

Despite recent advances in video inpainting techniques, reconstructing large missing regions of a moving subject while its scale changes remains an elusive goal. In this paper, we have introduced a scale-change invariant method for large missing regions to tackle this problem. Using this framework, first the moving foreground is separated from the background and its scale is equalized. Then, a ...

متن کامل

Ambiguities in Camera Self-Calibration

Structure from motion (SfM) is the problem of computing the 3D scene and camera parameters from a video or collection of images. SfM can be further classified as calibrated and un-calibrated. In calibrated SfM, the internal camera parameters are known. This is a much easier problem than the un-calibrated case, where these parameters are unknown. Solving for the internal camera parameters are kn...

متن کامل

Review: Recent Structure-From-Motion Algorithms 3D Shape Reconstruction

Existing face recognition systems are based on 2D facial images and exhibit well-known deficiencies. Accordingly, the face recognition research is gradually shifting from classical 2D to sophisticated 3D or hybrid 2D/3D. 3D shape reconstruction from multiview photographs and video sequences (2D images) is an active area of research which can fully leverage the potential of existing 2D image acq...

متن کامل

Real-time Structure from Motion for Augmented Reality

Our work is focused on developing a real-time structure from motion (SfM) algorithm that is usable in an augmented reality system. We envisage augmented reality applications involving a head-mounted camera and display system. This requires an SfM algorithm that is robust to different scene structure and camera motion and will invariably have to deal with the problems of occlusion, clutter and m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1704.07804  شماره 

صفحات  -

تاریخ انتشار 2017